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Abstract. In this article we show the relationship between the Pareto dis- 
tribution and the gamma distribution. This shows that the second one, ap- 
propriately extended, explains some anomalies that arise in the practical use 
of extreme value theory. The results are useful to certain phenomena that are 
fitted by the Pareto distribution but, at the same time, they present a devia- 
tion from this law for very large values. Two examples of data analysis with 
the new model are provided. The first one is on the influence of climate vari- 
ability on the occurrence of tropical cyclones. The second one on the analysis 
of aggregate loss distributions associated to operational risk management. 
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1. Introduction 

The extreme value theory is used by many authors to model exceedances in 
several fields such as hydrology, insurance, finance and environmental science, see 
Furlan (2010), Coles and Sparks (2006), Moscadelli (2004). However, the theory 
shows some surprises in practical applications. For instance, Dutta and Perry 

(2006) observed, in an empirical analysis of models for estimating operational risk, 
that even when Pareto distribution fit the data it may result in unrealistic capital 
estimates (sometimes more than 100% of the asset size), see also Degen, et al. 

(2007) . In other instances despite being well-founded the power law-distribution, 
as in Corral, et al. (2010), it may happen that it works in the central region but 
not for larger values. These challenges should motivate us to find new models that 
describe the characteristics of the data rather than limit the data so that it matches 
the characteristics of the model (Dutta and Perry, 2006). 

The peaks over threshold (PoT) method for estimating high quantiles is based on 
the Pickands-Balkema-DeHaan Theorem, see McNeil, et al. (2005) and Embrechts, 
et al. (1997). Hence, in practice, the conditional distribution of any random variable 
over a high threshold is approximated by a generalized Pareto distribution (GPD). 
This result is a mathematical solution to the question, but the practical problem 
whether the threshold is high enough still remains. 

In this paper a new statistical approach for estimating high quantiles is provided 
for non-light tails data sets. It is shown that the Pareto distribution is nested in the 
statistical model here called full-tails gamma (FTC) distribution. FTC model is 
a scale parameter family of distributions on (0, oo) closed by truncation, Hence, it 
allows us to find distributions so close to the Pareto distribution as determined by 
the data, but with greater flexibility, extending the distributions for non-light tails 
provided by GPD. With the current specialized computer programs for statistical 
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analysis is not difficult to deal with the FTG distribution, since the incomplete 
gamma function and its derivatives are now easily available, see Abramowitz and 
Stegun (1972). The work of pioneers like Chapman (1956) must be viewed in this 
way. Another approach for lighter tails is in Akinsete, et al. (2008). 

The FTG distribution is related to very old families of distributions as the Pareto 
III distribution, see Arnold (1983, pp 3) and Davis et al (1979). The gamma distri- 
bution is one of the most studied families of distributions, since Fisher (1922). For 
life theory and reliability the two-parameter right truncated gamma distribution is 
usually considered since Chapman (1956). Den Broeder (1955) considered the left 
truncated gamma distribution but with known scale parameter. Stacy (1962) intro- 
duced a three-parameter generalized gamma distribution which includes, as special 
cases, the two-parameter gamma and the two-parameter Weibull. Harter (1967) 
extends the model to a four-parameter family, by including a location parameter. 
Hedge and Dahiya (1989) obtain necessary and sufficient conditions for the exis- 
tence of the MLE of the parameters of a right truncated gamma distribution. The 
right truncated gamma with unknown origin is studied by Dixit and Phal (2005). 
For simulation of right and left truncated gamma distributions, see Philippe (1997). 
Also in physics literature the FTG distribution appears related to the power-law 
with (exponential) cut off, see Clauset et al (2009) or Sornette (2006), however, the 
models are not the same. 

In Section 2, FGT distribution is introduced, showing that the domain of pa- 
rameters includes the gamma distribution and the Pareto distribution (Theorem 
1) in the boundary. Proposition 2 provides a clear interpretation of its three pa- 
rameters (pt,9,p). The FTG distribution for a > is the left truncated gamma 
distribution relocated to the origin . The FTG distribution for a < appears as the 
full exponential model generated from a canonical statistic. Section 3 describes the 
most basic statistical properties of the FTG, as the moments generating function, 
a simulation method and the standard tools for MLE. 

In Section 4, we provide applications of the FTG exemplify that are usually fit- 
ted by Pareto distribution. The first one on the influence of climate variability and 
global warming on the occurrence of tropical cyclones, see Corral, et al. (2010). 
Here, classical goodness of fit test rejects Pareto distribution but it offers no alter- 
native to that model. The alternative is here provided by the FTG distribution. 

The second example deals with the analysis of aggregate loss distributions as- 
sociated to operational risk management, see Degen, et al. (2007). The concept 
of operational risk is founded in the Basel II accord of 1999, that has been widely 
adopted around the world as a regulatory requirement by central banks. The fo- 
cus on systemic risk, precipitated by the current crisis, has elevated operational 
risk management to greater prominence. Risk capital, under the PoT approach, 
has been calculated here with Pareto and FTG distributions. Table 3 shows that 
Pareto distribution provides unrealistic and highly unstable estimations, however, 
FTG distribution provides more realistic and much more stable risk capital estima- 
tions. 



2. The full-tails gamma distribution 

The FTG distribution is the three-parameter family of continuous probability 
distributions, with support on (0, oo), defined by a e K, 6 > 0, p > by 
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(2.1) / (x; a, 9,p) = 6{p + Ox)*" 1 exp(-(p + Qx))/T{a, p). 

where T (a, p) is the (upper) incomplete gamma function, see Abramowitz and 
Stegun (1972), 

/•DO 

(2.2) r(a,p) = / f- l e-*dt, 

J P 

in particular T (a, 0) = T (a) is the gamma function. The FTG distribution extends 
to some boundary parameters as shall seen bellow. 

If a > 0, 9 > and p = 0, the family (|2.1[) clearly extends to the probability 
density function of the gamma distribution, defined by 

(2.3) g (x; a, 9) = 9 a x a ~ 1 exp (-6x) /T (a) 

For a > the FTG is the left truncated gamma distribution relocated to the 
origin, or equivalently it is a tail of gamma distribution. Note that in this paper, 
tail is used in sense of conditional exceedances over a threshold. More exactly, 
suppose F(x) is a absolutely continuous cumulative distribution function of a non 
negative random variable, X . The probability density function of the exceedances 
of X at u > is defined by 

(2.4) f u (x) = f(x + u)/(l-F(u)) 

where f (x) = F' (x). If a > and p > 0, then (|2.1[) is the probability density 
function of the exceedances of a gamma distribution at a > 0, with a = p/9 

(2.5) q„ (x; a, 9) = 9 a (x + a)^ 1 exp (-9 (x + a)) /V {a, a 6) . 

The probability density function of the Pareto distribution, defined for x > 0, is 

(2.6) p (x; a, a) = — aa^ 1 (1 + x/a) a 1 , 

where a < and a > 0. This parameterization will be used to show that the Pareto 
distribution appears in this boundary of the FTG distribution. 

Theorem 1. Let a = p/9 > fixed in (|2.1[) and a < 0. If p tends to zero, then 
the probability density function (|2.1[) tends to the probability density function of the 
Pareto distribution \2.6\) in L 1 norm. Moreover, the convergence extends to the 
moments, provided the corresponding moments for Pareto distribution are finite. 

Proof. Observe that if p tends to 0, then 9 tends to 0, since a = p/9 > is fixed. 
Using 9 — p/a, 

f(x; a, 9, p) = p a a-\l + x/a)^ 1 exp(-p(l + x/a))/T(a, p) 

converges pointwise to the probability density function (|2.6[) . since the property 
(5.1.23) of Abramowitz and Stegun (1972), under the assumptions, 

(2.7) \im p a /T(a,p) = -a 

holds. Observe that for p small 

f(x;a,6,p) = /9( j- 1 (l+2;/CT) a - 1 exp(-p(l + a;/a))/r(a,p) 

< -2acr" 1 (l + x/a) a ~ 1 = 2p{x;a,a) 

since from the limit (|2.7| we can consider the boundedness p a /T(a,p) < —2a. 
Finally, from the dominated convergence theorem we obtain the convergence in L 1 . 



4 



JOAN DEL CASTILLO, JALILA DAOUDI AND ISABEL SERRA 



Moreover, whenever the moments of Pareto distribution are finite, the convergence 
extends to these moments. □ 

Therefore, the family (|2.1[) has the boundary parameter sets corresponding to the 
gamma distribution:{a > 0, 9 > 0, p = 0} and the Pareto distribution: {a < 0, 9 = 0, a 

Summarizing, the FTG distribution (|2 . 1 1) includes the gamma distribution, the 
truncated gamma distribution (a > 0), its extension to a<0 and the Pareto distri- 
bution, see Figure |2~T1 



Probability density functions 




FIGURE 2.1. The left shows some probability density functions in 
FTG family. The FTG with a = 2, cr = 1 and 9 = 1 corresponds 
to the tails of a gamma and the FTG for a = —0.2, a = 1 and 
9 = 0.1 is not. For the boundary parameter sets of the family, we 
consider gamma with a = 2 and 9=1 and Pareto with a = —0.2 
and cr = l. The right shows the same plot in common logarithm 
for the tail of the functions. We see exponential decay except for 
Pareto probability density function. 



Proposition 2. Let X be a random variable distributed as FTG(a,9, p), then 

(a) For A > 0, the random variable XX is distributed as FTG (a, 9/ X, p). 

(b) For any threshold, u > 0, the threshold exceedances, X u is distributed as 
FTG{a,9,p + 9u). 

Proof. The first result holds from the probability density function of XX for A > 0, 
/ (x/X; a, 9, p) /X = (9/X) (p + 9x/X) a ~ l cxp(-(p+9x/X))/T(a, p) = f(x; a, 9/X, p) 
remark that for 9 = is 

p (x/X; a, a) /X = — a(Aer) -1 (1 + xj (Xa)) a 1 = p(x; a, Act). 
And the second one is a consequence of (|2.4p . For 9 > is 



/ (x + u;a,0,p) 9(p + 9u + 9x) a 1 , , x „ ifl » ,, a a , 

1 WT~\ = ty Ta~\ C M-(P + 0u + 6x)) = f(x; a,9,p + 9u) 

1 — b (u) l(a,p + 9u) 

and for 6 = □ 
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From the last result it is clear that the FTG distribution (|2.1|) is a scale parameter 
family on (0, oo) closed by truncation, in the sense of ()2.4j) . Hence it is appropriate 
for modelling (non-light) tails of datasets in the sense Balkema-DeHaan (1974) 
and Pickands (1975), since it contains the Pareto distribution and the exponential 
distribution. The parameter j3 = 1/9 is the scale parameter and the parameter p is 
the truncation parameter. The parameter (—a) shall be interpreted in terms of the 
Pareto distribution as the weight of the tail. Then, each one of the three-parameter 
separately has a clear interpretation. 

Given a fixed, the family (|2.1j) for a < appears as the full exponential model 
generated from a canonical statistic (x, log (er + x)), see Barndorff-Nielsen (1978), 
Brown (1986), Letac (1992). 

The FTG distribution is related to the three-parameter model Pareto type III 
from Arnold (1983), caracterized by the survivor function 

(2.8) F(t) = (1 + t/(f>)- x cxp(-9t). 

Moreover, the Pareto type III has been considered as a model for survival data, 
Davis (1979). This fact is natural since the Pareto type III model is a mixture of 
two FTG distributions. In fact, Pareto type III model is a particular case of the 
six-parameter mixture model of two FTG models. 

Finally, (|2.1[) can also be seen as a weighted version of the Pareto distribution, 
with the weight w(x) = exjp(—9x), that is also known as an exponencial tilting of 
the distribution, see Barndorff-Nielsen and Cox (1994). 

3. STATISTICAL TOOLS AND MLE 

With the current specialized computer programs for statistical analysis is not dif- 
ficult to deal with the FTG distribution. The incomplete gamma function, T (a, p) 
and its derivatives are now easily available. Symbolic differentiation allows us to get 
the moments of a distribution from the moment generating function. Simulation 
and optimization algorithms are available in the same way. The work of pioneers 
like Chapman (1956) must be viewed in this way. 

The cumulative distribution function corresponding to the family (|2.ip is 

F (x; a,9,p) = l- T(a, p + 6x)/T(a, p) 

and for the Pareto distribution we have to consider the limit case, corresponding 
to P (x; a, a) = 1 - (1 + x/a) a . 

The FTG distribution has moment-generating function in the interior of the do- 
main of parameters. Hence, it is possible to calculate the moments of all orders. In 
addition it is also possible to calculate the moments of the conditional distribution 
over a threshold, by Proposition [2] For a£l,fl>0,p>0, the moment-generating 
function of the FTG distribution (|2.1j) exist and it is given by 
(3.1) 

M(t) = M(t;a,6,p) = (1 - t/9)~ a exp (-pt/6) T(a, p(l - t/9))/T(a, p), t < 9. 

For a > 0, it extends for p = and coincides with the moment generating function 
of gamma distribution M g (t) = (1 — t/9)~ a 
The cumulant generating function is given by 

K (t) = log (M(t)) = -tp/0 - a log (1 - t/ff) - log r(a, p) + log T (a, (1 - t/0) p) 
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hence, the first moments are 

E[X] = K'(0) = (a-p + p)/6 

Var[X] = K"(0) = (a + (l+p-a)p-p 2 )/9 2 

where p = e~ p p a /T(a, p). Notice that using the Proposition [5J to calculate the 
conditional expectation for any threshold fixed u > 0, is the same as to calculate 
the expectation with modified parameters 

(3.2) E[X\X>u] = (a- p + p')/9 
where p! = er^+^^p + 6u) a /r(a, p + 9u) . 

3.1. Random variates generation. Simulation methods for Pareto and gamma 
distributions are well known. Has also been well studied the simulation of truncated 
gamma distribution (|2.5p . see Philippe (1997). Hence, only the set of parameters 
{a < 0, 9 > 0, p > 0} for FTG distribution is considered here. 

A simple way to simulate the distribution is the inversion method, since the 
cumulative distribution function has an easy expression, however, it needs to use 
complex numerical processes using the incomplete gamma function. 

A simple and efficient method from numerical point of view is obtained with 
an idea from Devroye (1986) on a generalization of the rejection method. We 
emphasize the simplicity of this algorithm, since it does not require the use of the 
incomplete gamma function. 

First of all, since 1/9 is a scale parameter it is enough consider simulations for 
6 = p. That is, to simulate FTG (a, 8, p), we can first simulate FTG (a, p, p) and 
finally we apply the change of scale to the random sample. 

For 9 = p, the probability density function (|2.ip split in three terms 

(3.3) / (*; a, p, p) = {p a - l e-"/T{a, p)) ( P e~ px ) (1 + xf' 1 = cg{x)i>{x) 

where the function ip (x) = (1 + x) a ~ l is [0, l]-valued, g (x) — pe~ px is a probability 
density function easy to simulate and c is a normalization constant at least equal 
to 1. 

The rejection algorithm for this case can be rewritten as follows. Generate 
independent random variates (X, U) where X has probability density function g (x) 
and U is uniformly distributed in [0,1] until U < ip(X). This method produces a 
random variable X with probability density function fix), (Devroye, 1986). 

The following code applies the method to our case, see R Development Core 
Team (2010). 

#to generate a sample of size n of FTG(a,t,r) 
rFTG<-function(n,a,t ,r) { 
sample<-c () ; m<-0 
while (m<n) { 

x<-rexp(l ,rate=r) ;u<-runif (1) 
if (u<=(l+x)~(a-l)) sample [m+1] <-x 
m<-length (sample) } 
sample*r/t } 

3.2. Maximum likelihood estimates of the parameters. In (|2.1[) . FTG dis- 
tribution has been introduced with parameters (a,9,p), since each one separately 
has a clear interpretation. For MLE estimation it is better to use (a,cr, p), with 
dispersion parameter a — p/9, since fixed a the FTG distribution is an exponential 
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model. Hence, from Barndorff-Nielsen (1978), it is known that the maximum like- 
lihood estimator exist and it is unique. The summary of the procedure to compute 
the MLE is search the dispersion parameter and then to optimize the problem for 
others. 

Let x = {x\, ...,x n } be a of size n, the log-likelihood function for FTG distribu- 
tion is 

(3.4) 

( 1 71 71 
log r (a, p) + log (ap-<*) -^±Y, l °g( 1 + -)+-Y,( 1 + - 
i=l i=l 

To simplify, we denote 

(3.5) d = d(a, p) = logr(a, p) 
and we consider (r, s), for a fixed as 

r(x; a) = (1 + x/a) and s(x; a) = log (1 + x/a) 

which are the sufficient statistics from exponential model point of view and then 

we denote the sample means as 

n 1 n 

f(x; a) = — > (1 + Xi/a) and s(x; a) = — } log (1 + Xi/a) . 
n £ — ' n z — ' 

i=l i=l 

To simplify, we use the parameters in subscript to denote the partials derivatives 
and we omit the dependence of the parameters in these derivatives. Hence, the 
scoring is (l a , l a , l P ) and it is given by 

(3.6) l a = -n{d a - log (p) - s(x;cr)} 

(3.7) l a = -n {cr _1 - (a - l)s a + pf a } 

(3.8) l p = —n {d p - ap^ 1 + f(x;cr)} 
the observed information matrix is given by 



I Q (a,a,p) = -n 



( d aa Sfj d a p p N 

—8 a -o~ 2 - (a - l)s aa + pf aa f a 



\ d ap - p 1 r a dp P + ap 2 J 

and it can be used to compute the confidence interval for the maximum likelihood 
estimates a, a and p of the parameters a, a and p, respectively. 

To compute the MLE is convenient to solve the equation ()3.7[) for to get a using 
(a(a),p(a)) for the parameters (a, p) or, more general, to maximize the profile 
log-likelihood equation 

(3.9) 

l p (a) = -n ( logT(a(a),p(a)) + log (ap(a)- & ^ - (a(a) - l)s(x, a) + p(a)f(x, a)) 

where (a(a), p{cr)) is the only one solution of the system in (a, p) consists of the 
equations (|3.6p and (|3.8p for a fixed. Remark that, from a practical point of view, 
is convenient to consider this pair of equations to simplify the equation (|3.9p (or 
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()3.7p ) in an equation as light as possible of the sample explicitly. For instance, the 
equation (|3.9| can be simplified by 

(3.10) 

l p {a) = -n (logT(d(cr),/5(cr)) - log (p(a)a x ) - (a(a) - l)d a - p{a)d p + a(a))) 

remark that is an expression without the sample explicitly. 

A procedure to obtain the MLE in R is computing the MLE of the standardized 
sample y ~ {xi/x}i<i<m considering the initial estimates as follows. We have two 
options: to take the initial estimates as (d, 1, 9) where (d, 8) is the MLE of gamma 
model or take the initial estimates as (d, &, p) where (d, a) is the MLE of Pareto 
model and p is obtained by the relation (from the equation (|3.8[) ) 

d p - ap- 1 + 1 + cr _1 = 0. 

Finally (a,a,p) (the MLE for the sample x) is obtained using the Proposition [2j 
in fact we obtain a = a', a — a' /x and p = p' . 

Finally, it might be appropriate to consider de log-scale for a and p. R has a 
package to optimize which greatly simplify the calculation the MLE. 

4. Data Analysis 

Certain phenomena that may be fitted by Pareto distribution, or the power-law 
distribution, present a deviation from these laws for very large values. It is often 
due to the interference that produces an overall limit (a finite ocean basin or a loss 
limited to the total value of a economy) . The motivation of this work was to find a 
model to explain this fact in several cases, such as the energy of tropical cyclones 
or the calculation of regulatory capital for operational risk. . 

In the first example, Choulakian and Stephens (2001) goodness of fit test rejects 
the Pareto distribution, but no alternative is provided. We show that FTG is a 
better model fitting even the very large values. In the second example, goodness of 
fit test can not be applied, since the parameter is outside the range of parameters 
provided by their tables. However, FTG is a better model that Pareto distribution, 
providing more realistic and much more stable risk capital estimations. 

4.1. Analysis of tropical cyclones. Corral, et al. (2010) study the influence of 
climate variability and global warming through the occurrence of tropical cyclones. 
Their approach is based on the application of an estimation of released energy to 
individual tropical cyclones. We are going to compare our model with its statistical 
analysis on power-law distribution for 494 tropical cyclones occurred in the North 
Atlantic between 1966 and 2009. 

To measure the importance of the tropical cyclones it is used an estimation of 
released energy, the power dissipation index (PDI), defined by 

PDI = vfAt 
t 

where t denotes time and runs over the entire lifetime of the storm and v t is the 
maximum sustained surface wind velocity at time t (PDI units are m 3 /s 2 ). The 
PDI of the original data is between 5.38 10 8 and 2.54 10 11 . Deviations from the 
power law at small PDI values were attributed to the deliberate incompleteness 
of the records for 'no significant' storms. Their estimation only considers tropical 
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cyclones with PDI bigger than 310 9 , that is a sample of size 372 (75% of the original 
data). 

Figure 14.11 shows the fit of the power-law distribution with an empirical aprox- 
imation probability density function of the sample. Given the sample of tropical 
cyclones {x{\ for 1 < i < n with n = 494, Corral, et al. (2010) approximate 
the probability density function at points p r = 10 8+ ( r_1 " 5 for 1 < r < m where 
to = 21, for the histogram values 

_ #{ Xj : lr < Xj < l r+ i} 
n (l r+1 - l r ) 

for the intervals given by l s = 0.5 10 8+s / 5 ll 1 / 5 with 1 < s < to + 1. The goal of 
their method is to plot in common logarithm (base 10) scale for both axes, since the 
power-law probability density function in this situation corresponds to a straight 
line. The fit is done by minimum square method for a set of points {(u r , v r )}, where 
u r = log 10 p r and v r = log 10 h r . 

Our first contribution consists in fitting the FTG distribution by MLE for the 
whole sample. The FTG distribution shows a really best fit especially in the tail of 
the data, see Figure 14.11 The more rapid decay at large PDI is associated with the 
finite size of the ocean basin. That is, the storms with the largest PDI do not have 
enough room to last a longer time. The relevant thing is that FTG distribution fits 
the date even in this situation. 
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Figure 4.1. FTG distribution fits better than Pareto distribution 
the tropical cyclones data set, especially in the tail of the observa- 
tions. The plot is scaled in common logarithm for both axes. 



Theorem Q] shows that Pareto distribution is nested in FTG distribution, hence 
likelihood inference is now available. MLE of parameters and its standard devi- 
ations are shown in Table Q] for FTG distribution and Pareto distribution (with 
two parameters). The values of log-likelihood function are —667.58 for FTG case 
(truncated gamma distribution) and —680.06 for the Pareto case. 

First of all, the goodness of fit test for Pareto distribution given by Choulakian 
and Stephens (2001) rejects with p-value less than 0.001 for both statistics, W 2 = 
0.28 and A 2 = 2.4, of the method, but it offers no alternative to the model. Fi- 
nally, the likelihood ratio test can be used to find a confidence region around the 
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FTG parameters, concluding that the difference between the FTG and the Pareto 
distribution is highly significant. The p- value is 5.8 10~ 7 . 





Pareto distribution 
a a I 


FTG distribution 
a a p I 


LRT 


MLE 
s.e. 


-1.63 2.01 -680.06 
0.22 0.41 


0.28 0.09 0.02 -667.58 
0.15 0.11 0.02 


24.96 



Table 1 . MLE for FTG and Pareto distributions for tropical cy- 
clone occurred in the North Atlantic between 1966 and 2009. The 
data used corresponds to PDI over 3 10 9 , with the origin shifted 
to zero (units are 10 10 m 3 /s 2 ). This change does not afect the 
likelihood ratio test, LRT, and the a parameter. 



4.2. Analysis of aggregate loss distributions. Financial institutions use in- 
ternal and external loss data in order to compare several approaches for modelling 
aggregate loss distributions, associated to quantitative modelling of operational risk, 
see Dutta and Perry (2006), Degen, et al. (2007) and Moscadelli (2004). The data 
used for the analysis was collected by several banks participating in the survey to 
provide individual gross operational losses above a threshold, starting on 2002. The 
data was grouped by eight standardized business lines and seven event types. 

Risk capital is measured as the 99.9% percentile level of the simulated capital 
estimates for aggregate loss distributions in holding period (1 year). A loss event Li 
(also known as the loss severity) is an incident for which an entity suffers damages 
that can be measured with a monetary value. An aggregate loss over a specified 
period of time can be expressed as the sum 



TV 

(4.1) S = 5>< 

i=l 

where A is a random variable that represents the frequency of losses that occur over 
the period. As usual, here it is assumed that the Li are independent and identically 
distributed, and each Lj is independent from N, that is Poisson distributed, with 
parameter A. 

The data set used here correspond to the 40 largest losses associated with the 
business line corporate finance and the event type external fraud, observed over a 
high threshold, u. To maintain confidentiality, the data {xj} has been scaled to 
threshold zero and mean 100, according to 




The 40 exceedances, rounded to two decimal place, were: 0.07, 0.11, 0.26, 0.40, 
0.46, 0.62, 0.70, 0.75, 0.89, 1.08, 1.52, 1.64, 1.69, 2.04, 2.19, 2.52, 2.73, 3.16, 3.74, 
4.04, 4.63, 5.44, 5.86, 6.02, 10.32, 19.63, 29.13, 30.36, 30.88, 35.78, 40.07, 46.12, 
137.52, 237.05, 311.14, 314.19, 396.29, 552.48, 864.88, 891.62. 

Aggregate losses are determined mainly by the extreme values of loss events 
distribution. In this case, risk capital depends on 40 exceedances, but, to calculate 
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the 99.9% quantile, a model is required. Under the PoT approach extreme values 
are modelled with Pareto distribution, see Degen, et al. (2007) and Moscadelli 
(2004). Pickands-Balkema-DeHaan theorem justifies the approach, see McNeil, et 
al. (2005). However, this approach may result in unrealistic capital estimates, 
especially when the fitted Pareto distribution has infinite expectation. 

Since the data set has only exceedances over a threshold, the PoT method is the 
appropriate way. When all losses are recorded, Dutta and Perry (2006) use a four- 
parameter distribution, called g-and-h, to model the data. If we focus on extreme 
events of financial assets returns, both upside and downside, standard method- 
ologies also include the classical Student's t and stable Paretian distributions, see 
Rachev, et al. (2010). 





Pareto distribution 
a a I 


FTG distribution 
a a p I 


LRT 


MLE 
s.e. 


-0,45 1,38 -174,44 
0.10 0.73 


-0,20 0.65 4.3E-4 -172,37 
0.16 0.59 6.2E-4 


4,14 



Table 2. MLE for FTG and Pareto distributions of losses by ex- 
ternal fraud. FTG distribution is a better model than Pareto dis- 
tribution, from likelihood ratio test. 



Table[5]gives the MLE of parameters for Pareto and FTG distributions, as well as 
its standard deviations and log-likelihood function, for the last data set. First of all 
we observe that for Pareto distribution the parameter is in the range < (—a) < 1, 
that is, a distribution with infinite expectation. This can not be rejected with 
the goodness of fit test for Pareto distribution given by Choulakian and Stephens 
(2001), since the parameter is outside the range of parameters provided by their 
tables. However, Pareto distribution is nested in FTG distribution (Theorem [T]) 
and likelihood ratio test is 4.142, with p-value 0.042. Hence, FTG distribution is 
a more likelihood model for the data set, since Pareto distribution is outside of a 
95% confidence region for FTG distribution parameters. 

Figure 14.21 shows the empirical survival (or reliability) function and its fit given 
by Pareto and FTG distributions. The probability to exceed the maximum of the 
sample is estimated at 5, 52% for the Pareto distribution and 2.65% for the FTG 
distribution, this difference does not seem essential. However, the estimation of 
high quantiles heavily depends on the model. The 0.999 quantile is 6.95 x 10 6 for 
the Pareto distribution and 3.93 x 10 3 for the FTG distribution. Moreover, the 
difference is even greater to calculate the expected tail loss over this quantile, that 
is the expected value of a loss if a tail event does occur; it is 12970.6 for the FTG 
distribution, since (|3.2[) . and infinite for the Pareto distribution. Note that these 
quantities are measured in a monetary unit (as dollars) to calculate risk capital, 
hence a factor of 10 3 is really important. 

Risk capital has been calculated as 0.999 quantile of the aggregate losses, com- 
puted from (|4.ip . by simulating 10 5 times N loss events, where N is Poisson dis- 
tributed with parameter A = 20 and the loss events, L,, are simulated from the 
fitted Pareto and FTG distributions. Using the FTG distribution the risk capital 
is 10820.4, using Pareto distribution is 5.78 x 10 9 . If our data were in thousands 
of dollars (probably is greater) the Pareto estimation of risk capital for a bank is 
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Distribution of losses by external fraud 

• Empirical 1,0 
distribution 
Pareto 

FTG 




1000 



Figure 4.2. The FTG and the Pareto distributions fit the empir- 
ical survival function in a similar way in the range of the observed 
sample. However, the estimated high quantiles differ greatly.The 
left shows survival distributiona and the picture in the right shows 
the same plot in common logarithm for the tail of the functions. 





Pareto distribution 






FTG distribution 


sample 


a 


cr 


Risk capital 


a 


logfl 


logp 


Risk capital 


1 


-0,310 


0,367 


2,47E4 


-13 


-0,038 


-7,093 


-9,250 


10832,98 


2 


-0,373 


1,122 


3,50E4 


-11 


0,003 


-6,771 


-8,251 


9292,72 


3 


-0,410 


1,719 


5,36E4 


-10 


-0,106 


-7,341 


-7,792 


13407,05 


4 


-0,423 


1,351 


l,87E-f 


-10 


-0,057 


-6,543 


-7,460 


6934,26 


5 


-0,441 


2,195 


1.23E4 


-10 


-0,006 


-6,520 


-7,217 


7603,78 


6 


-0,460 


1,205 


2,63E4 


-09 


-0,298 


-8,039 


-8,287 


16860,11 


7 


-0,486 


1,097 


6,78E4 


08 


-0,276 


-7,313 


-7,828 


8921,12 


8 


-0,538 


1,769 


l,78E-f 


-08 


-0,360 


-7,613 


-7,444 


10997,30 


9 


-0,612 


3,923 


3,86E4 


-07 


-0,257 


-6,723 


-6,141 


6503,94 


10 


-0,763 


3,916 


1,66E4 


-06 


-0,371 


-6,113 


-5,461 


3276,98 


original 


-0,448 


1,382 


5,78E4 


-09 


-0,197 


-7,325 


-7,754 


10820,37 



Table 3. Parameter estimates and the risk capital from the 
Pareto distribution and the FTG distribution for 10 bootstrap sam- 
ples and the original data set. 



about the same order as the USA gross domestic product (that is unrealistic), see 
the last file in Table [3l 

In order to see the sample dependence of the risk capital estimate, we generated 
several bootstrap samples of the same size as the original data set. It is observed 
immediately, with a small number of samples, the instability of the risk capital 
estimates obtained with the Pareto distribution. However, the estimates obtained 
with FTG distribution are much more stable. 

Table [3] reports the parameter estimates and the risk capital from the Pareto 
distribution and the FTG distribution for 10 bootstrap samples and for the original 
data set. In all cases risk capital has been calculated in the same way. Samples were 
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selected from 100 bootstrap samples, ordered by the parameter a, choosing one out 
of 10, for more diversity. Note that only sample-2 corresponds to the truncated 
gamma distribution (|2.5p and their behaviour is not different from the rest. The 
most prominent fact is that, in addition to the unrealistic risk capital estimation 
with Pareto distribution, its estimation is highly unstable, with a factor of 10 7 . 

We must remember that just as the extreme levels of energy for the tropical 
cyclones are affected by the limits of the Earth, the economy is also finite. Hence, 
FTG distribution can be a valuable alternative to Pareto distribution on operational 
risk. 
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